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Abstract 

We transfer the celebrating Monge-Kontorovich problem in a bounded do- 
main of Euclidean plane into a Dirichlet boundary problem associated to a 
quasi-linear elliptic equation with 0— order term missing in its diffusion coeffi- 
cients: 

A(x, F' X )F'^ + B{y, F' y )F'^ = C(x, y, F' x , F' y ) 

where A(., .) > 0, B(., .) > and C are functions based on the initial distribu- 
tions, F is an unknown probability distribution function and therefore closed 
the former problem. 



The mass transport problem was first formulated by Monge in 1781, and con- 
cerned finding the optimal way, in the sense of minimal transportation cost of moving 
a pile of soil from one site to another. This problem was given a modern formula- 
tion in the work of Kantorovitch and so is now known as the Monge-Kontorovich 
problem. 

This type of problem has appeared in economics, automatic control, transporta- 
tion, fluid dynamics, statistical physics, shape optimization, expert system, meteo- 
rology and financial mathematics. For example, for the general tracking problem, a 
robust and reliable object and shape recognition system is of major importance. A 
key way to carry this out is via template matching, which is the matching of some 
some object to another within a given catalogue of objects. Typically, the match 
will not be exact and hence some criterion is necessary to measure the "goodness of 
fit". 

Many mathematicians from different fields are interested in Monge-Kontorovich 
problem. This classical problem was revived in the mid eighties by the work of 
Y.Brenier([6], [7]), who characterized the optimal transfer plans in terms of gradi- 
ents of convex functions. In the last decades, this problem has been recovered to 
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have a close relationship with certain evolutionary PDE's, which can be interpreted 
as gradient flows of certain entropy functionals with respect to a metric (which is 
well-known to probabilists, see Knott-Smitt [2D] and Rachev-Rueschendorf |26| ) in- 
volving optimal transportation called Wasserstein metric. The first application to 
mathematical physics (kinetic models) is due to Tanaka, in the seventies. In the 
early nineties the use of entropy functionals as a tool to prove convergence to equi- 
librium received a strong impulse due to the work of Cercignani, Carlen, Carvalho, 
Pulvirenti, Desvillettes, Toscani, Villani and others. Moreover, Toscani proved that 
similar methods could be used to prove optimal convergence to similarity for dif- 
fusion equations. At the same time Jordan, Kinderlehrer and Otto [T7] discovered 
that the Fokker Planck equation can be solved by a steepest descent method in- 
volving a logarithmic entropy functional and the Wasserstein distance. This work 
marks the beginning of the modern gradient flow theory on Wasserstein spaces. Af- 
ter a few years, Arnold, Carrillo, Del Pino, Dolbeault, Jiingel, Markowich, Toscani 
and Unterreiter established the link between convergence to equilibrium for linear 
and nonlinear Fokker-Planck type equations and logarithmic Sobolev inequalities, 
by developing a previous idea of Bakry and Emery [3] (with applications to the 
Porous medium equation). The key ingredients of this theory (the log-Sobolev and 
the Csiszar-Kullback inequalities) are related to certain Gaussian isoperimetric in- 
equalities (see e.g. Talagrand and Otto- Villani) . Otto realized simultaneously that 
nonlinear diffusion equations can be seen as gradient flows in the 2- Wasserstein space 
of probability measures of a free energy functional. This metric structure has been 
made rigorous by Ambrosio-Gigli-Savar. At the same time, Carrillo-McCann- Villani 
applied these ideas to granular media models producing these arguments in smooth 
settings. A basic ingredient of this theory is the notion of convexity along geodesies 
in the Wasserstein space introduced by McCann, also called displacement convexity. 

Another striking application of the optimal transportation (from the probabilis- 
tic point of view based on martingales theory) is the justification of the mean field 
limits of certain stochastic particle models by means of the theory of concentra- 
tion inequalities developed (among the others) by Levy, Gromov, Milman, Bobkov, 
Ledoux, Malrieu. A computational method for finding entropy functionals for evo- 
lutionary equations has been recently proposed by Juengel-Mattes. The use of the 
Wasserstein distance has been also extended to scalar conservation laws (Bolley- 
Brenier-Loeper, Carrillo-Di Francesco-Lattanzio) . The use of these ideas to study 
the long-time asymptotics of dissipative homogeneous kinetic models is based on 
the almost equivalence of the Euclidean transportation metric with Fourier-based 
metrics and on the basic mechanism of contraction of probability metrics (Gabetta- 
Toscani-Wennberg, Bisi-Carrillo- Toscani, Bolley-Carrillo, Carrillo- Toscani) . Several 
(important) authors have been involved in literature of the optimal transporta- 
tion theory, with remarkable applications, we mention here Caffarelli [8j[9j[llJ, 
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Ledoux j2TJ , Evans and Gangbo [H] , Carlen and Gangbo [12] . We recommend Caf- 
farelli's address to ICM2002 [HJ and Trudinger's invited lecture to ICM2006 [30] 
and L.C.Evans and W.Gangbo's paper [13] for major references from PDE point of 
view. We also recommend S.T.Rachev and L.Riischendorf's book [26] for a major 
reference from probability point of view. 

There are several formulation of Monge-Kontorovich problem. We are going to 
use its formulation in terms of probability theory, which is to find the Kantorovich- 
Rubinstein-Wasserstein distance in the plane. Suppose that we are given two prob- 
ability distributions P and P on R 2 . A 4— dimensional random vector (X, X) with 
P and P as the marginal distributions is called a coupling of this pair (P, P). 
The minimum of the coupling distance \\X — X\\l 2 among all such possible cou- 
plings is called Kantorovich-Rubinstein-Wasserstein L 2 — distance between P and 
P. From weak convergence theory, it is easy to see the existence of this optimal 
coupling (X, X). The problem is to find a concrete way to get them. It has im- 
portant applications in both probability theory and mass transfer problems. How- 
ever, the problem has been only completely solved in one dimensional case. In R , 
Kantorovich- Rubinstein- Wasserstein L 2 — distance is just given by |32j 



where F and F are distribution functions of P and P respectively, F l {t) and 
F~ (0 < t < 1) are their right inverses. 

Without losing generality, we may just consider two probability measures P and 
Q on [0, 1] x [0, 1]. Let X and Y be two random vectors defined on a same probability 
space with P and P as their individual laws. Denote 



and denote by P its probability distribution which is on [1, 2] x [1, 2]. Then 



which gives the relation between Kantorovich- Rubinstein- Wasserstein Li— distance 



of (P, Q) and that of (P, P). Since -E[X{\ + E[Yx] - E[X 2 ] + E[Y 2 ] is given, it is 




(0.1) 



X = (X 1 ,X 2 ) = (Y 1 + 1,Y 2 + 1) 



E[\X! - X x \ 2 + \X 2 - X 2 \ 2 } 
£[|Xi-yi-l| 2 + \X 2 -Y 2 -1\ 2 ] 

E[\Xi - Yil 2 + \X 2 - Y 2 \ 2 } - 2E[X X ] + 2E[Y l ] - 2E[X 2 ] + 2E[Y 2 ] + 2 



sufficient to discuss the later. 
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Assume that P is a probability measures on [0, 1] x [0, 1] and and P is a prob- 
ability measure on [1,2] x [1,2]. Suppose that the couple X = (X\,X<2) and 
X = (Xi,X 2 ) give the desired Kantorovich- Rubinstein- Wasserstein L 2 — distance. 
If we denote Z = (X\,X 2 ), then 

E[\X - X\ 2 } = E[\X - Z\ 2 } + E[\Z - X\ 2 ]. 

So it is sufficient to find the distribution of Z which is supported in [0, 1] x [1, 2]. 
We assume further the density functions f(x,y) of X and f(x,y) of X are smooth 
and strictly positive on their domains. Denote the marginal densities 

fi(x)= [ f(x,y)dy, f 2 {y) = f f(x,y)dx 

Jo Jo 

and 

h{x) = J f(x,y)dy, f 2 (y) = J f(x,y)dx. 

Furthermore, denote the conditional distributions 

1 f x 1 rv 

J2(y) Jo fi{x) Ji 

and 

F\{ x \y) = T7~T / f( u iV) du i F 2{y\x) = -~— / f{x,u)du, 

f2(y) Jo h(x) J 1 

which are strictly increasing with respect to their first argument so their inverse 
functions with respect to their first arguments exist and denoted as G(l,s,y) = 
F^Hsly), G(2,x,t) = F^Htlx), G(l,s,y) = i^^y) andG(2,x,t) = F 2 (_1) (t|x). 
Without losing generality, we assume that there is a positive constant c > such 
that 

1 -0,(1,,.) >c _L-G V (2,.,.) >c (0.2) 



/ 2 (.) /i(0 

and that all functions appeared in the later equation (jO.lip are sufficiently smooth. 
Our those regularity hypotheses will not affect the generality of our problem, because 
what we will treat later is the unknown distribution function F of Z, which is 
continuous under the weak convergence of the laws of (X, X). Therefore we can 
always use the usual regularizing approximation procedures. 

Denote by 7i the set of all density functions q(x,y) on [0, 1] x [1,2] such that 
h{x) = Ji q(x, y)dy and f 2 (y) = Jq 1 q(x, y)dx. 
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We are looking for a density function p(x, y) satisfying 

1) peW; 

2) p(x, y) = q(x, y) minimizes 
" ''' rs q(u,y) 2 



/ I s — ^C 1 ' / 7 ] x du >v)\ q(s,y)dsdy 
i Jo Jo f 2 {y) 



+ f 1 C It - G(2, x, C ^^-dv)\ 2 q{x, t)dt dx (0.3) 
Jo J i Jo fi{x) 



q(x, v 



For < a < a\ < 1 and 1 < b < b\ < 2 when e is small enough, 
a + e < ai < ai + e < 1, 6 + e < 6 X < &i + e < 2. 

Define 

£(s,t) = ^([a,a+e]x[b,6+e])U([ai,ai+e]x[6i,fei+e])(s,i) 

-^([a.a+elxIbi.bi+eDudai.ai+elx^.h+eDC 5 ^)- (°- 4 ) 

Then p(s, t) + <5£(s, t) € TL when both e, 5 are small. Since p is the minimum, by 

o< - 2 i 2 fis-oii, r p(u ' y) +/f ( "' y) ^,y)i 2 ( P ( S ,^+^,y))&^ 

+^ T [ 2 \t-G(2,x, C P{X} V \ + . *f V) dv)\ 2 (p(x, t) + 8£(x, t))dt dx 
^ Jo Ji Jo fi(.x) 

1 I \s-G(l, [ V ) du, y) \ 2 p(s, y)ds dy 



e 2 Ji Jo ' "'Jo f 2 (y) 

~\ t [ 2 \t-G(2,x, [ t ^^-dv)\ 2 p(x,t)dtdx (0.5) 
t l Jo Ji Jo fi{x) 

e h Jo Jo f 2 (y) 

- rs p(u,y) 



,s-G(l, / ^f^-du,y)\ 2 p{s,y)dsdy] 
l Jo Jo / 2 (y) 

/ 2 |t-G(2,x, ^(^^) + ^,.) 2 ^ 
e Jo Ji Jo 

- /' [ 2 \t-G(2,x, fv^ldv)\ 2 p{x,t)dtdx} 
Jo Ji Jo Ji{x) 
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4( f /' I. - 0(1. f P( "- y) + SI : (a ' y) iu,y)\H i (s,y)a, C i v 

e A Jo Jo f 2 (y) 

+ [ 2 \t-G(2,x, C ^ (X ' V) dv)\ 2 8Z{x, t)dt dx} 

Jo Ji Jo Jl\ x ) 

Letting e — ► 0, we get 

o f ai ff>fi f* P( u > b ±h \ r gW j ^P^iMw 

°- - 2 A (G( U w ,y)_s) * ( H w ' l} m 

J a Jo f 2 (b) Jo f 2 {b) f 2 (b) 

~ 2 l <G(2 ' < "'I ~JKai) y l ' L 7^*° AM* 

+2 /"(G(2,a, /' - ()G;( 2, , /' *S^*&!l« 

A Jo Ji(a) y j o Ji(a) /i(a) 

+ ai-G(l, / - - o-G(l, / - d«,bi) 

Jo / 2 (6i) Jo / 2 (6i) 

-k - f + k> - fid, r*£Sw 

Jo f 2 (b) Jo f 2 (b) 

+ \h - G(2, ai , f bl - \b-G(2, ai , f P -^-d V )? 

Jo hiflv Jo /i(ai) 

-| 6l _ G(2 ,a, /" + |6 - 0(2, a, 

Jo Ji{a) Jo h( a ) 

Multiplying both sides by (ai _ a) 1 (6l _ 6) , letting (a± - a)(bi - b) -> 0, we get 

8 2 



dxdy 

where 



M(x,y)>0 (0.6) 



M(x,y) 

-2 f (G(l, r^cfo.y)-^!, [' rihVl^y)^*, 
Jo Jo f2(y) Jo f 2 (y) f 2 (y) 

n[ V (ntn f* P( x ' v ) a \ 4\r>> in f* P( x > v ) a \P( x ^) j 4 

- 2 k (G( Wi W <fo) ~ ()G '' ( Wi 

+|* - 6(1, f ^liu, 4- |„ - G(2, x, r 

Jo f 2 (y) Ji h{x) 
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f 20- (5(1, I* V^Adu,y))ds 
Jo 1 V 'Jo h(y) ,y " 



io h{y) 

+ f V 2(t-G(2,x, f p -^ldv))dt 
= * 2 + V 2 - 2 f G(l, f ^du, l)ds - 2 [ y G(2, 0, f P -^dv)dt 

JO JO /2(1) J 1 «< 1 JH U J 

-2 f f {[G(l, [ S P^.duM + [Gi2, S , f P -^±dv)}' s }dsdt (0.7) 

j0 JO / 2 (t) •/ 1 

On the other hand, if one replace p + 5£ by p — 5£, the same computation leads 

4 M(i ' !)so (o - 8) 

Thus we deduce from (|0.6|) and (|0.8j) that 
a 2 

M(x, y) = (V < x < 1 < y < 2), 



<9x<9y 

or (V < x < 1< y < 2) 



[G(l, f y)]' v + [G(2, x, ^ P -^-dv)]' x = (0.9) 

Jo f2(y) h in 1 ) 

Denote the probability distribution function F(x,y) = Jq p(s , t) dt ds . Then 

F>{x,y) = A(x) r^TT^' KM = [ Xp jTTdu, 
Ji fi{x) y Jo j2{y) 

(|0.9p becomes 

[G(l, yi-F^x,^,^]; + [G(2,x, -^-^(x,!/))]^ = (0.10) 
/2(y) Aw 

or 

G,(l, * ^(x,y),y)^F^(x^ 

= -g,(i, jrr F y(. x > y)'V)~ x > F - (x ' y)) 



+G,(1, y|-^(x, S,),^^!, y) 

h{y) ' J2\y> 



+Gy {2,x,-^—F' x {x,y))^\F' x {x : y) (0.11) 
Ji\ x ) Ji\ x ) 
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which is a quasi-linear elliptic equation with unknown F(x,y), which satisfies the 
uniform ellipticity condition (|0.2p . Since its diffusion coefficients only contain the 
first order partial derivatives, under minus regularity condition, the solution of the 
Dirichlet boundary problem (\/x € [0,1], My € [1,2]) : 

F(0,y) = 0, F(x,l)=0, F(x,2) = F fi(s)ds, F{l,y) = F f 2 (t)dt 

Jo Jo 

has a unique solution ([IB] p. 264). 



Furthermore, if we plug (|0.9j) into (|0.7p . then 

M(x, y) = x 2 + y 2 -2 F (5(1, F ^^du, l)ds 

Jo Jo / 2 (1) 

- 2 l" G{2 '°-l! P -m dv)dt (ai2> 

That is, M(x, y) can be written as a closed-form solution which depends only the 
initial values p(0, .) and p(., 0) 
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